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Annotation. The necessity of integrated navigation complexes (INC) construction is substantiated. It is proposed 
to include in the complex the following inertial systems: inertial, satellite and visual. It helps to increase the accuracy of 
determining the coordinates of unmanned aerial vehicles. It is shown that in unfavorable cases, namely the suppression 
of external noise of the satellite navigation system, an increase in the errors of the inertial navigation system (INS), 
including through the use of accelerometers and gyroscopes manufactured using MEMS technology, the presence of 
bad weather conditions, which complicates the work of the visual navigation system. In order to ensure the operation of 
the navigation complex, it is necessary to ensure the suppression of interference (noise). To improve the accuracy of the 
INS, which is part of the INC, it is proposed to use the procedure for extracting noise from the raw signal of the INS, its 
prediction using neural networks and its suppression. To solve this problem, two approaches are proposed, the first of 
which is based on the use of a multi-row GMDH algorithm and single-layer networks with sigm_piecewise neurons, 
and the second is on the use of hybrid recurrent neural networks, when neural networks were used, which included 
long-term and short-term memory (LSTM) and Gated Recurrent Units (GRU). Various types of noise, that are inherent 
in video images in visual navigation systems are considered: Gaussian noise, salt and pepper noise, Poisson noise, 
fractional noise, blind noise. Particular attention is paid to blind noise. To improve the accuracy of the visual navigation 
system, it is proposed to use hybrid convolutional neural networks. 


Keywords: integrated navigation complex, hybrid recurent neural network, hybrid convolutional neural 
network. 
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Anotauia. OOrpyHToBaHo HeOOXxigHicTb NOOyAOBU IHTerpoBaHux HaBirauiHux KommekciB (IHK) y ckuazi 
iHepIiasbHOi, CyIyTHMKOBO! Ta Bi3yasIbHOi HaBiralliHHx CHCTeM, IO CIpHAe MIABUINCHHIO TOUHOCTI BU3HaYecHHA 
KOOpAHHAaT Oe3MiOTHHX JiTabHux anapatip. Iloka3aHo, WO B HeCIIPHATIMBHX BHTaKax, a caMe MIpHAyuIeHHA 
30BHILIHIMH TepelIKOaMH CYyIyTHHKOBOI CHCTeMM HaBirallii, HapOCTaHHA NOMMJOK iHepllaIbHOi CHCTeMM HaBirallli 
(IHC), B ToMy 4NCIi 3a paxyHOK 3aCTOCyBaHHA aKCeJIepOMeTPpiB i TIpOCKOMIB, BUKOHaHUX 3 BUKOpucTaHHimM MEMC 
TeXHONOTii, HatBHOCTI MOraHHXx MOTOAHUX YMOB, WO yCKIa{HIOIOTh poOoTy CHCTeMH Bi3yasIbHO! HaBirali. 3 MeTOrIO 
3a0e3Ie4eHHA POOOTOCIPOMOXHOCTI HaBiraliiHoro KOMIVIeKcy MOTpiOHO 3a0e3NeuHTH UpUAyWIeHHA Mepelikoy 
(urymy). Ja mizBuujeHHaA TouHocTI IHC, wo BxoquTs Wo cknany IHK, npononyeTsca BHKOpHcTOByBaTH pouesypy 
BUIIeHHA WyMy 3 HeoOpobseHoro curnany IHC, toro nporHo3yBaHHA 3a JOMOMOTO!O HelpOHHHX Mepex Ta Horo 
mpugywenua. Jit BUpiWIeHHA WbOrO 3aBJaHHA 3alpONOHOBaHO Ba TWiAxofa, Mepwiui 3 AKHX OCHOBaH Ha 
BHKOpHCTaHHi OaraTopsqHoro anroputmMy MI'YA ta ofHOWapoBHx Mepex 3 HeiipoHaMn sigm piecewise, a Apyruli - 
Ha BMKOpHCTaHHi TiOpHyaHuxX PeKypeHTHHX HelipOHHHX Mepex, KOJIM BUKOPHCTOBYBaJIMCb HeHpOHHi Mepexi,, 40 
cklayly AKHX BXOJATb JOBrOTpHBala Ta KOpOTKOYAaCHa TlaM'ATb Ta KepoBaHHii peKypeHTHuM OOK (BeHTHIbHuK 
pekypeHTHuii By30n). J[1A MABHUICHHA TOUHOCTI CHCTeMM Bi3yasIbHOi HaBirallii MpOMOHyeTbCA BHKOPHCTOByBaTH 
TIOPpHAHI 3FOPTKOBI HeMpoOHH! Mepexi. PosruaHyTO pi3Hi THMM WIyMI1B,AKi IpHTaMaHHi Bieo 300paxKeHHAM B CHCTeMax 
BI3yaJIbHOi HaBiralli:rayciB UWIyM, ULYM CO Ta Meplko, WyacOHiBCbKHit WIyM, ApoOoBuli ulyM, cminmui urym (blind 
noise). OcoOsmmuBy yBary MpuyiseHo crimuM wrymam (blind noise). J[1a WiXBHIeHHA TOUHOCTI CHCTeMM Bi3yaJIbHOi 
HaBIrallli IPOMOHYETbCA BAKOPHCTOBYyBaTH FIOPHAHI 3TOPTKOBI HelpOHH! Meperxkl. 
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Kso4uosi cioBa: inTerpoBaHHii HaBiraliiHuli KOMIIeKCc, TiopuyHa peKkypeHTHa HelipoHHa Mepexa, ridpuyHa 


3rOpTKOBa HelipoHHa Meperka. 


Introduction 
Requirements for the accuracy and 
reliability of navigation support are 


characterized by the following indicators: 
accuracy, readiness (the degree of probability 
of the complex operability), integrity (the 
degree of probability of detecting a failure 
within a given time) and reliability (the degree 
of probability of maintaining the required 
accuracy of the complex operation during the 
specified time intervals). In most cases, the 
reliability and quality of navigation support 
depends on the redundancy factor [1 — 3]. 

The general trend in the development of 
the market for navigation systems for mobile 
objects is such that, under the influence of 
increasingly stringent requirements, 
developers are moving towards deepening 
integration between inertial, satellite and other 
systems. At the same time, the ICAO 
Committee on Advanced Navigation Systems 
(FANS-Future Air Navigation System) 
recommends the use of onboard Satellite 
Navigation System (SNS) with a mandatory 
combination with an inertial navigation 
system (INS) as the central link of the 
navigation complex. This is due to the 
following reasons: INS provide complete 
information on navigation parameters of 
movement — angles of heading, pitch (trim), 
roll; acceleration, speed of movement and 
coordinates of the object's location. Moreover, 
they are completely autonomous, i.e. do not 
require any information from outside. Due to 
the ability to determine the angular position of 
an object with high accuracy in any range of 
angles and with a high frequency of 
information output, ANNs have no alternative 
to date. SNS — is an all-weather navigation 
system covering the earth, and its positioning 
accuracy does not diverge over time, provides 
precise navigation solutions at low cost, since 
a handheld chip receiver is cheap and 
sufficient for common applications. 

However, in conditions of limited 
visibility of satellites, as well as in case of 
loss of information from a satellite navigation 
system, for example, due to difficult radio- 
technical conditions for receiving signals, 
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there is not enough initial data to obtain an 
acceptable navigation solution. Therefore, in 
this case, there is a threat of loss of 
information support for the flight of the 
Unmanned Aerial Vehicles (UAV) and, as a 
consequence, of the loss of the UAV itself 
(the coarse Strapdown Inertial Navigation 
System (SINS) is not able to provide the 
UAV with flight and navigation information 
of the required accuracy, even at short 
intervals). 

In this case, the integration system will 
turn into an autonomous inertial navigation 
system that provides navigation solutions. 
However, without outside help, — the 
positioning errors of the inertial navigation 
system (INS) quickly diverge due to the noise 
contained in the initial data of the inertial 
measuring unit. In particular, the inertial 
measuring unit of the micromechanics system 
allows more complex errors due to the 
manufacturing technology. 

Thus, each of the above systems 
separately cannot provide a_ high-quality 
solution to the navigation problem. Therefore, 
it is advisable to build an_ integrated 
navigation complex (INC), which includes 
navigation systems operating according to 
various physical principles, which will make 
it possible to increase the reliability of 
navigation support for UAV flight, both in 
basic and in autonomous modes of operation. 
Increasing the reliability (credibility) of 
navigation support can be achieved at the 
expense of the maximum, including the 
alternative. using the capabilities of the 
existing information support, as well as 
through new approaches to the problems of 
integrating flight and navigation information 
[4]. The visual navigation system (VNS), 
which consists of a built-in microcomputer 
and a camera, is free from the above 
disadvantages, which is advisable to include 
in the INC. Image processing is realized by an 
intelligence system based on convolution 
neural networks. 

There are many ways to combine 
information from the many navigation 
systems that make up the INC. Designing an 
integration architecture is a trade-off between 
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increased accuracy and_ robustness in 
obtaining a navigation “solution” that 
minimizes complexity and optimizes the 
efficiency of navigation information 


processing. This should also be taken into 
account for the navigation systems that make 
up the INC. 

But in unfavorable cases, namely 
suppression by external interference of the 
satellite navigation system, an increase in 
errors of the inertial navigation system, 
including due to the use of accelerometers and 
gyroscopes made using MEMS technology, 
the presence of bad weather conditions that 
impede the operation of the VNS, with in 
order to ensure the operation of the navigation 
complex, it is required to provide interference 
suppression. 

To improve the accuracy of INS, which 
is part of the INC, it is proposed to use the 
procedure of noise separation from INS raw 
signal, its predicting with help of neuron 
networks and its suppression. The procedure 
of noise separation from INS raw signal is 
considered in [5]. The gyroscope data was 
processed as a time series, and_ the 
experiments used a real data set from the 
inertial measuring unit of the 
micromechanical system. 


Problem Statement of nonstationary 
processes predicting 

For a class of nonlinear nonstationary 
processes: 


{X (k,e)} ER", 
where E[X(k)]# const, var[X(k)] # const, 
build predictive models (neural networks) 
x(k) = f[8, z(k), e(k)], 
where @ is the vector of model parameters; 
k =0, 1, 2, z(k) is the 
independent explanatory variables; e(k) is 


discrete time; 
the random disturbing process with arbitrary 
distribution. 

Based on the model (in the form of a 
neural network) to create the prediction 
features 


XK(k+s) = E, [x(k +5)|x(k), 
x(k -1),..., x(O), &(k),..., €(0)], 
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where x(k+5s) is the a function that allows 
you to calculate future values of a master 


(dependent) variable based on known 
historical data {x(k), x(k-1),...,x(0)} and 
estimates of random process _ values 


{&(k), €(k—1), ...,€(0)}; s is the number of 
forecasting steps. 

Analyze the quality of forecast 
estimates by many of the following statistical 
criterias MSE error and average absolute 


percentage error (MAPE): 
s. |x. (k+i) —x(k+i 
mare =! 50 | os i | 100%. 
Sey |x-(k+i)| 


where x,(k+i) is the actual values of the 


main variable pertaining to the test sample; 
x(k +i) are estimates. 


Predicting time series using neural 
networks 

After analyzing the various methods 
currently available for constructing a 
predictive model, the most flexible methods 
can be clearly called the Artificial Neural 
Networks (ANN) and the Group method of 
data handling (GMDH). In this work it is 
proposed to use two approaches. The first is 
based on combination of ANN and GMDN, 
where the ANN is used as a _ reference 
function of the GMDH. Due to this, this 
algorithm combines the advantages of both 
ANN and GMDH: a gradual increase in the 
complexity of the GMDH model and the 
opportunity to learn GMDH. The second 
approach is based on hybrid recurrent neural 
network (RNN) which is a combination of 
Long Short Term Memory (LSTM) and the 
Gated Recurrent Unit (GRU) [6]. 


The use of the GMDH multilayer 
algorithm and single-layer networks with 
sigm_piecewise neurons 

To solve all these problems, it is 
proposed to use the approach of the GMDH 
multilevel algorithm [7], which will allow: 

— much faster than under full brute 
force use, to find in a certain sense the 


optimal network structure, with given 
restrictions on the number of network 
parameters; 
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— automatically filter out 
"uninformative" variables. 

The algorithm for automatic synthesis 
of optimal network — structure with 
sigm_piecewise neurons based on the GMDH 
multilevel algorithm consists of the following 
steps: 

1) Input sample <X,y:X ~R>, 
where the set x is a finite subset of space 


R",n>2 is in some way divided into two 
samples: training R", n>2 and validational 
_ <x” a? -x” >» R> xX” cX 


XM AX =O XP UX SX. Usually, 
about 70% of the patterns are selected in the 
training sample, and the simplest variant, 
which works well in practice, is random 
selection. 

2) Independently one from one C* 


neurons of the sigm_piecewise type are 
learing, where each neuron has inputs — 
x,,xX,3 1,7 =1,....n,i< j — that is, all possible 
pairs of inputs are sorted out, and a separate 
neuron is trained for each pair. For learning it 
is used training sample, the criterion 
according to which the parameters of each 
neuron are tuned is the following: 


E, (,,0_,h) 


= > (y (%) -sigm_piecewise((x,,x,]; 
%ex'?) 
ae eis ew A 
(ib, 1A) 
3) The value of the "external" criterion 
is calculated for each neuron. The most 
common criterion is the MSE model on the 
validation sample: 


C, (0, W_,h) = > (y@) 


=> \2 
—sigm_piecewise([x,,x,]";¥,,_,/)) 
4) | «Cz |. € (0,1) the neurons with 


the worst external criterion value are 
discarded. Parameter value a is selected 
depending on the limit on the number of 
network parameters, or according to some 
heuristic, if no such restriction. For example, 
if there are variables in the input variables that 
are completely unrelated to the predicted 
variable, then the external criterion value for 


the neurons that used these variables as inputs 
is usually much worse than the value for the 
neurons that used the "informative" variables 


(Fig. 1). 


Fig. 1. An example of the values of external criterion 
"model error on validation sample" of different neurons 
with the presence of "uninformative" variables 


Another approach of MEMS sensors 
noise prediction is the use of recurrent neural 
networks, in particular long short-term 
memory (LSTM), gated recurrent units 
(GRU) and a_ hybrid neural network 
consisting of LSTM and GRU [8]. 

Let's first consider each of the LSTM 
and GRU networks separately. 


Description of LSTM 

Thus, the LSTM was proposed to solve 
the long-term dependence problem of the 
existing RNN. It is a modified architecture of 
the RNN and is characterized by the existence 
of a cell state (Fig. 2). The LSTM is 
composed of three gates, namely the input, 
forget, and output gates. Each gate controls its 
value through the sigmoid layer and pointwise 
multiplication [8]. 


Fig. 2. Long short-term neural network architecture 


The input gate (it) utilizes a sigmoid 
function (o). The sigmoid function determines 
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which of the new information is to be stored 
in the cell input state. Next, the tanh function 
creates new data candidate values as the cell 
candidate state, C, . 


The forget gate determines what is 
forgotten using the weights of how much to 
forget and updates the state by multiplying the 
previous state, C;-1, by the output value of the 
forget gate. In addition, the output values of 
the input gate are multiplied and added to the 
cell state C;. 

The output gate (O;) determines the 
output value and uses a sigmoid function. The 
final output value is calculated by multiplying 
the value obtained by applying tanh to the cell 
state (C;) and the sigmoid output value (O;). 

The LSTM unit is the most basic 
component of an LSTM architecture; it’s a 
series of gates and cells that work together to 
produce a final result. The forward pass of an 
LSTM unit is modeled by follows equatios [9]. 


f, =0(W,x,+U/h+b,), (1) 
i, =o(Wx, +U\h,,+5,), (2) 
é, = tanh (Wx, +Uh,, +b,), (3) 
C, =f *c,, +i, *¢,, (4) 
0, = 0(W,x, +U,h,_,+,), (5) 
h, =o, *tanh(c,), (6) 


where o is sigmoid function, f; is forget gate 
activation vector, i; input gate activation 


vector, 0; Output gate activation vector, Cc, 


cell input activation vector, c; cell state, h, 
output vector of LSTM unit, all W and U are 
weights, b is biases vector and symbol * for 
Hadamard product (element wise product). 
Weights W, U and biases b are to be learned 
during the training process. In order to better 
decipher the equations cited above, let’s start 
with cell state c;, it contains two kinds of 
information: old information to keep from 
past state c-1 specified using forget gate, it 
decides the percentage of information to keep 
by computing a value between 0 (throw 
completely) and 1 (keep completely), and new 
information to include in the cell state 
calculated using input gate i; and cell 
activation c, which are computed using (2) 


and (3), respectively. Finally the calculation 
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of the fnal output value is performed in two 
steps. A potential value is calculated using (5) 
for the first time. This value will be regulated 
using the information present in cell state as 
indicated in (6). Te use of the cell state in the 
final calculation makes the LSTM powerful in 
tasks where information must be stored and 
used later (long term). 


Description of GRU 

Although the LSTM has an excellent 
ability to memorize long-term dependencies, 
it takes a long time to train the model owing 
to its complex structure. Thus, the GRU was 
proposed to accelerate the training process 
[10]. 

The GRU is a type of RNN framework 
with a gate mechanism applied, inspired by 
LSTM, and with a simpler structure [11]. The 
GRU architecture is described in Fig. 3. 


Fig. 3. Gated recurrent unit architecture 


The reset gate (r;) uses the sigmoid 
function as an output to properly reset the past 
information and multiplies the (0.1) value by 
the previously hidden layer. The output value 
can be obtained by multiplying the hidden 
layer’s value at the previous point and the 
information at the present point by weight. 

The update gate is a combination of the 
forget and input gates of the LSTM model. 
The update gate determines the rate of update 
of past and present information. In the update 
gate, the result wu, output as sigmoid 
determines the amount of information at 
present, and the value subtracted from 
1(1 — u;) is multiplied by the information of 
the hidden layer at the most recent time. Each 
update gate is similar to the input and forget 
gates of the LSTM. 

The forward pass of a GRU unit is 
modeled by follows equatios [9]: 
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z,=0(W,x,+Uh,,+,), (7) 
r= 0(W,x,+U,h,, +b), (8) 
h, = tanh(W,x,+U,(7,*h,,)+,). @) 

h, =(1-z,)*h,, +2, *h, (10) 


where o is sigmoid function, z; is update gate 
activation vector, 7 reset gate activation 


vector, h, candidate vector and /; is the 


output vector of GRU unit. W and U are 
weights, b is bias vector and symbol * for 
Hadamard product. Same as for LSTM, 
weights and biases are to be learned 
during the training process. Let us go deep 
through the equations cited above to better 
understand how a GRU unit works. First 
update gate is computed using input vector x;, 
output of previous unit h; —1 and sigmoid 
activation function. Reset gate is calculated in 
the same way as update gate using its own 
weights and bias. The reset gate is then 
involved in the candidate value calculation. It 
determines how much information from the 
previous state should be preserved. Indeed, 
from equation (9) we can notice that with an 7; 
value very close or equal to zero, only the 
input value will be considered in candidate 
value calculation. Finally, the output value is 
calculated by calibrating the previous output 
and the new candidate output. Te calibration 
is carried out by the updating gate z; (z; = 0 
copy the previous output, z; = 1 generate a 
new output regardless of the old output). We 
can observe a kind of similarity between 
LSTM and the GRU, indeed both implement 
an intermediate gating mechanism used later 
in the output value calculation. 

It maybe noticed that the GRU 
optimization is faster. In general, both LSTM 
and GRU are powerful and well-suited to 
sequential data. The GRU has the beneft of 
faster optimization compared with an LSTM 
because it has less parameters. 


Hybrid Neural Structure of LSTM 
and GRU 

In practical applications, instead of 
using single-layer LSTM or GRU, multilayer 
LSTM or GRU is generally used however 
with no more than three layers [6]. The two- 
layer hybrid structure is shown in Fig. 4. The 
input layer was LSTM while the output layer 


was GRU and thus, a heterogeneous network 
abbreviated as LSTM-GRU was developed, 
while in Fig. 5, the input layer was GRU and 
the output layer was LSTM to produce the 
other heterogeneous network referred to as 
GRU-LSTM. In paper [6] the four hybrid 
models of deep RNN designed above all 
composed of four parts is presented. The first 
part was an initial state where in random 
initialization were made, while the second 
part was used for feeding the sequence data as 
networks’ inputs. The third stage includes a 
collection of hidden states in each of the 
LSTM or GRU layer, while the last part is 
where the network prediction output is done. 
The latter can take the state output of the last 
step by weighing the state from all previous 
steps or directly averaging them to produce 
the output. 

As mentioned earlier, the third 
component of the INC is the visual navigation 
system, which uses the processing of images 
obtained from a video camera to determine 
the current coordinates of the UAV. In case of 
bad weather conditions, the images obtained 
from the video camera will be of bad quality, 
which ultimately will lead to significant errors 
in determining the coordinates of the UAV. 
To overcome these difficulties, it is necessary 
to perform image noise reduction. 


Predicted Outpat 


Initial 
State 


Sequence Data Input 


Fig. 4. The structure of LSTM-GRU 


— 


a 
Predicted Output 


Fig. 5. The structure of GRU-LSTM 
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Deep Learning on Image Denoising 

For today, the most promising approach 
to eliminating for image denoisimg is deep 
learning neural networks. 

It is necessary to consider: deep 
convolutional neural networks (CNNs) for 
additive white noisy images, deep CNNs for 
real noisy images, deep CNNs for blind 
denoising and deep CNNs for hybrid noisy 
images, which is the combination of noisy, 
blurred and low-resolution images. Among 
deep learning networks for solving image 


noise reduction problems, convolutional 
neural networks are the most promising 
[12, 13] 

However, due to the following 


drawbacks, they were not widely applied into 
computer systems [15]. Firstly, deep CNNs 
can generate vanishing gradients. Secondly, 
activation functions (i.e. sigmoid [16] and 
tanh [17]) resulted in high computational cost. 
Thirdly, the hardware platform did not 
support the complex network. That was 
broken by AlexNet in 2012 ImageNet Large- 
Scale Visual Recognition Challenge 
(LSVRC) [15]. After that, deep network 
architectures (e.g. VGG [18] and GoogleNet 
[19]) were widely applied in fields of image 
[20, 21] video [22, 23], nature language 
processing [24] and speech processing [25], 
especially low-level computer vision [26, 27]. 
Deep network was first applied in image 
denoising in 2015 [28, 29]. The proposed 
network need not manually set parameters for 
removing the noise. After then, deep network 
were widely applied in speech [30] and image 
restoration [31]. Mao et al. [31] used multiple 
convolutions and deconvolutions to suppress 
the noise and recover the high-resolution 
image. For addressing multiple low-level 
tasks via a model, a denoising CNN 
(DnCNN) [32] consisting of convolutions, 
batch normalization (BN) [33], rectified linear 
unit (ReLU) [34] and residual learning (RL) 
[35] was proposed to deal with image 
denoising, super-resolution, and JPEG image 
deblocking. Taking into account between 
denoising performance and speed, a color 
non-local network (CNL-Net) [36] combined 
non-local self-similarity (NLSS) and CNN to 
efficiently remove color-image noise. 
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In terms of blind denoising, a fast and 
flexible denoising CNN (FFDNet) [37] 
presented different noise levels and the noisy 
image patch as input of denoising network to 
improve denosing speed and process blind 
denoising. 


Problem Statement of image 
denoising 
The given image determined as 
y=XH, 


where x, y and pu represent the given clean 
image, noisy image and additive Gaussian 
noise (AWGN) of standard deviation o, 
respectively. The purpose of image noise 
reduction is to obtain a clear image x from a 
damaged version y. 

Consider the following as additive 
image noises: Gaussian noise, salt and pepper 
noise, Poisson noise, shot noise. Consider 
each of the above noises separately. 


Gaussian noise 

The main sources of Gaussian noise in 
digital images occur during acquisition, such 
as sensor noise caused by poor lighting and/or 
high temperature, and/or transmission, such as 
electronic circuit noise. The standard model 
of this noise is additive, independent at each 
pixel, and independent of signal intensity, 
primarily caused by Johnson-Nyquist noise 
(thermal noise), including capacitor reset 
noise. Amplifier noise is the main part of the 
image sensor's read noise, i.e. constant noise 
level in dark areas of the image. In color 
cameras, where more gain is used in the blue 
channel than in the green or red channel, there 
may be more noise in the blue channel. 


The noise of salt and pepper 

The common "fat tail" or "impulsive" 
noise is sometimes referred to as salt and 
pepper noise or spike noise. An image 
containing salt and pepper noise will have 
dark pixels in the light areas and bright pixels 
in the dark areas. This type of noise can be 
caused by A/D converter errors, bit 
transmission errors, etc. This can be mostly 
eliminated using dark frame subtraction and 
interpolation around dark/bright pixels. 


Poisson noise 
Poisson noise or fractional noise is a 
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type of electronic noise that occurs when a 
finite number of energy-carrying particles, 
such as electrons in an electronic circuit or 
photons in an optical device, are small enough 


to cause statistical fluctuations in a 
measurement. 
Shot noise 


The predominant noise in the lighter 
parts of the image from the image sensor is 
usually the cause of statistical quantum 
fluctuations, i.e. by changing the number of 
photons that are perceived at a given level of 
exposure. This noise is known as photon shot 
noise. Shot noise has an RMS _ value 
proportional to the square root of the intensity 
of the image, and the noise in different pixels 
is independent of each other. Shot noise 
follows a Poisson distribution and is usually 
not very different from a Gaussian. 


Statement of the problem in the 
presence of blind noise 

The image is often distorted due to the 
imperfection of optical devices, defocusing of 
images, due to the influence of _ the 
environment between the object and the 
device for a number of other reasons. The 
problem of image restoration is, as a rule, 
incorrect (highly unstable) and it is necessary 
to use modern, stable methods to solve it. The 
solution of this problem requires the use of 
quite diverse and complex tools, including 
mathematical modeling, statistical estimation 
theory, Fourier transform, optimization 
methods, numerous linear algebra methods, 
etc. We will consider only halftone black and 
white images under the assumption that for 
processing a full-color image it is enough to 
repeat all the necessary steps for each each of 
the RGB color channels. Let us introduce the 
following notation: f(x, y) -—_ original 
undistorted image, h(x, y) — distortion 
function, n(x, y) — additive noise, g(x, y) — 
result of distortion, i.e. what we see as a result 
(a blurry or out-of-focus image). We 
formulate the model of the distortion process 
as follows 


g(x, y) = A(x, y)* fix, y) + nx, y). 


The problem of reconstructing a 
distorted image is to _ find the best 
approximation f(x, y) of the original image. 
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The function h(x, y) determines the process of 
distortion (transformation) of each pixel of 
the original image into a spot for the case of 
defocusing and into a segment for simple 
blurring. Or you can say the opposite, each 
pixel of the distorted image is collected from 
the pixels of some neighborhood of the 
original image. All this is superimposed on 
each other and as a result we get a disfigured 
image. 


Deep learning techniques for additive 
white noisy-image denoising 

Due to insufficiency of real noisy 
images, additive white noisy images (AWNI) 
have been widely used to train the denoising 
model [38]. The AWNI includes Gaussian, 
Poisson, Salt, Pepper and multiplicative noisy 
images [39]. Moreover, there are a lot of deep 


learning techniques (i.e. CNN/NN, _ the 
combination of CNN/NN and common 
feature extraction methods and __ the 


combination of optimization method and 
CNN/NN) for AWNI denoising [3, 40 — 51]. 


The combination of optimization 
method and CNN/NN for AWNI denoising 
It is known that machine learning uses 
optimization techniques [52, 53] and 
discriminative learning methods [54, 55] to 
deal with image applications in general. 
Although optimization methods have good 
performance on different low-level vision 
tasks, these methods need manual setting 
parameters, which were time-consuming. The 
discriminative learning methods have fast 
speed in image restoration. However, they are 
not flexible for various low-level vision tasks. 
To make a tradeoff between efficiency 
and flexibility, discriminative learning 
optimization-based method — [56] was 
presented for image applications, such as 
image denoising. The CNN _ with prior 
knowledge via regular term of loss function is 
common method in image denosing [57], 
which can mainly divide two categories to 
filter the noise: 
1) improvement of denoising speed; 
2) improvement of denoising performance. 
For improving denoising — speed, 
optimization method cooperated CNN was a 
good tool to rapidly find optimal solution in 
image denoising [58]. For example, a GAN 
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with maximum a posteriori (MAP) was used 
to estimate the noise and deal with other 
tasks, such as image inpainting and super- 
resolution [59]. An experience-based greedy 
and transfer learning strategies with CNN can 
accelerate genetic algorithm to obtain a clean 
image [60]. Noisy image and noise level 
mapping were as inputs of CNN, which had 
faster execution in predicting the noise [61]. 

For improving the denoising 
performance, CNN combined optimization 
methods to make noisy image smooth [62]. 

The main focus of real applications on 
deep learning for image denoising has two 
kinds: single end-to-end CNN and _ the 
combination of prior knowledge and CNN. 

For the first method, changing the 
network architecture is popular to remove the 
noise from the given real corrupted image. 
Multiscale is very effective for image 
denoising. For example, a CNN comprising of 
convolution, ReLU and RL_ employed 
different phase features to enhance the 
expressive ability of the low-light image 
denoising model [63]. To overcome the blurry 
and false image artifacts, a dual U-Net with 
skip connection was proposed for CT image 
reconstruction [64]. To address resource- 
constraint problem, Tian et al. [65] used a 
dual CNN with batch renormalization [66], 
RL and dilated convolutions to deal with real 
noisy image. According to the nature of light 
image, two CNNs_ utilized anisotropic 
parallax analysis to generate — structural 
parallax information for real noisy images 
[67]. Additionally, using CNN to resolve 
remote sense [68] and medical images under 
low-light condition is very effective. To 
extract more detailed information, recurrent 
connections were used to enhance the 
representative ability to deal with corrupted 
image in the real world [69]. To deal with 
unknown real noisy images, a_ residual 
structure was utilized to facilitate low- 
frequency features, then, an _ attention 
mechanism can be applied to extract more 
potential features from channels. From the 
point of view of producing the noisy image, 
imitating cameral pipelines to construct the 
degradation model was very effective to filter 
the real noisy [71]. 
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Deep learning techniques for blind 
denoising 

In the real world, the image is easily 
corrupted and noise is complex. Thus, blind 
denoising technique is important [72]. At first, 
FFDNet [37] used noise level and noise as the 
input of CNN to train a denoiser for unknown 
noisy image. Then, scholars proposed a lot of 
methods to solve blind denoisng problem. 
According to the mechanism of image device, 
Kenzo et al. [70] utilized soft shrinkage to 
adjust the noise level for blind denoising. For 
unpaired noisy image, using CNNs_ to 
estimate noise became a good tool [73]. Yang 
et al. [74] used known noise level to train a 
denoiser, then, they utilized this denoiser to 
estimate the level of noise. For random noise 
attenuation problem, CNN with RL was used 
to filter complex noise [75, 76]. Additionally, 
changing network architecture can promote 
the denoising performance for blind 
denoising. 

In this work, to solve the problem of 
noise reduction of images obtained from a 
video camera of a visual navigation system, 
the following criterion was used 


1, (£.14)=——|6,)-6,0,)). AD 
; gt hwd i i\” gt 
where I and I,, — noise-free image and 
corresponding noise-free image 


{oD = Diag denote N different maps of 


image features /. 

In this paper, we tackle this issue by 
developing a hybrid convolutional neural 
network. It is a blind denoising network 
(CBDNet) for real-world video. Ther 
CBDNet achieves very pleasing denoising 
results by retaining most structure and details 
while removing the sophisticated real-world 
noise. To sum up, the contribution of this 
work is four-fold: 

— synthetic noisy images and real noisy 
video are incorporated for _ better 
characterizing real-world image noise and 
improving denoising performance; 

— benefited from the introduction of 
noise estimation subnetwork, asymmetric loss 
is suggested to improve the generalization 
ability to real noise, and interactive denoising 
is allowed by adjusting the noise level map. 


ISSN_2710— 1673 Artificial Intelligence 2022 Nel 


Blind denoising of real noisy images 
generally is more challenging and can involve 
two stages, i1.e., noise estimation and non- 
blind denoising. In most blind denoising 
methods, noise estimation is closely coupled 
with non-blind denoising. CBDNet [77] 
consists of a noise estimation subnetwork 
CNNeg and a non-blind denoising subnetwork 
CNNp. We let the output of CNNeg be the 
noise level map due to that it is of the same 
size with the input y and can be estimated 
with a fully convolutional network. Then, 
CNNp takes both y and G(y) as input to 


obtain the final denoising _ result 
=F, (y, 6(y); W,)> where Wp denotes 
the network parameters of CNNp [77]. 


Results 

The use of the hybrid convolutional 
neural network CBDNet was tested when 
processing video images of cars in bad 
weather conditions (fog, light rain) on a 
sample of 100 images. The result of image 
processing is shown in Fig. 6. 


| 


“ 
= 4 


Fig. 6. The result of cars video image processing in bad 
weather conditions 


Conclusions 

The paper proposes approaches to 
improve the accuracy of integrated navigation 
systems by means of noise reduction methods 
use for inertial navigation and_ visual 
navigation systems. The problem is solved by 
using hybrid recurrent neural networks and 
hybrid convolutional neural networks. 
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